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QUONSET POINT, R.I. 


To investigate tae effect of the Evelyn two-dimensional night 
- vision trainer on scores on the Navy ail Plaque Adaptoneter 
(RPA) fest. / 


To compare with a control group the percent failing the RPA 
Test atter training. 


Followins one session on the Evelyn two-diaensional night 
vision trainer, a group exhibited 16.6% failures conpared 
to 18.4% failures in a Control group on the RPA test. 


when the test imnediately follows training, there are less 
_ failures (14.9%) than wnen there is a Seley of one week 
(19.67). 


tien who have previously passed tne RPA, are then Evelyn trained 
and subsequently RPA retested still snow 8,5, failures. 


In a small group, 13.8% of previous passers fail the RPA 
upon,retest. None of these men were Evelyn trained. 


while few of these findings are statistically reliable, tne 
observed tendencies are consistent and in the same direction 
Wnen neasured by several indicators. The sane tendencies are 
observed even after those individuals eter. a score of 0/20 
are arbitrarily excluded. 


c arily failing the avy Radium Plaque Adaptometer Test ie 9- B-4) 
aight be reduced if this test were preceded by training on the Evelyn 
_ Wight Vision Trainer (Device 9-). If training improves group performance 
on the night vision test, one might then prescribe training to limit the _ 
number of failures. while tne incidence of failures could be limited vt 
- arbitrarily by brightening tne Plaque or by liberalizing the performance 
-reguirements on the present test, training might insure that personel 
approacn the test with a more equal and relevant background of experience. © 
Shouli this be confirmed by evidence, the test would then intrinsically 

be more uniforn for all and perforuance tnereon a «more significant indi- 
cation of night vision ability. 


Bein t4L PROCEDURL; During the course of tnis experiment, men who appeared BS 
for night vision training were required also to be tested for their night = = 
visual ability. In some cases tne test immediatly followed the training, 
Waile in otiiersthe test cane after a week's delay. Tnis permitted an 
estiaate not only of the effect of training but of the duration of such 

an eifect as well. All subjects were questioned concerning previous night 
vision training and testing they might nave encountered. As a result of © 
this inioraation, some observations are possible concerning the effect of 
@ previous Rra test on subsequent RPA scores. None of the subjects had en- 
countered any form of night vision training prior to this experiment. 


while it mignt be of great interest to measure the effectiveness of the 
Evelyn trainer on niznt visual perfornance--and this is partly done in 
the present study--tue ,rincipal aia was to ascertain whether the RFA 
test results might be influenced by some training procedure. Accordingly, 
the percent normally failins the RFA nay be taken as the standard against Bech 
WhicaA comparisons way be evaluated. Information based on the normal test- 
ing of 4477 men at this Station was available and used for this purpose. 


Bota the Evelyn training and the RrA test were performed routinely on the 
mén involved in this study. Adequate dark adaptation was insured by naving 
the subjects wear rod goggles for twenty minutes and then spend ten more 
Minutes in coaplcte darkness before being tested. The Evelyn training ses-. 
Sion, Which. lasted about an hour, deaonstrated thc need for dark adaptation, 
péripneral vision and scanning procedures. 


in.all instances, tne Evelyn training proceeded the night vision test and, 
indeed, some men were led directly from oneto the other. These constitute 
_ our “immediately tested" group. Some other nen were tested a week after 
tne training session. 


b Among our subjects werc sone men for whom tnis experinert neant that they 
‘Were taking the RFA test for the second tine and these have been treated 
separately in the data. 


_ ***Thanks are due Lt.Comdr. N. B. Combs, (iC), USNR, for his cooperation 
on this study. 


RESULTS: The principal findings are sumnarized in Table 1. A total of 
=i 48 men were tested. 


The performance of these subjects may be assessed by comparing them 
With the records of groups that have been routinely run through the 

RPA at this Station. During the period of June 1944 to February 1945, 
4477 men were tested and, of these, 18.4% failed to reach the standard — 
Navy performance. (Fig 1) The median score for the whole group is 
18.28 correct out of 20. This figure approximates that reported by 

the U.S. Submarine Base at New London, Connecticut, where 17.6% men 
fail in a sanple of 5750 (1). The present authors have collected 


@vidence which suggests that the number of men failing the RPA test 


varies with the seasons of the year, the significant factor probably 
being exposure to sunlight; this will be tne subject of a separate 
report. fhe matter is pertinent to this report in setting up a 

proper control against whicn the experimental effects may be estimated. 
A sample of 1297 men were tested here during January and February 1945, 
While out data were being collected, shows that 18.7% failed. As one 
Would expect, this figure is not significantly different from the 
average for the whole group; (the chi-square test yields p = .77). 


One experimental group of 434 men showed 16.6% failures on the RPA 
after having been trained, instead of the normal 18,4%. Its median 
Score was 18.61 and this is sonewhat higher, i.e., better, than 18.28 
in the control group. (Fig 2) Thais information becomes more neaning- 
ful if analyzed to reveal the influence of a time delay between train- 
ing and the test. when the test immediately follows the training 
Session, as it did for 276 men, there are 14.9% failures and a median 
of 18.69. After a delay of one week for 153 men, the failures rise 

fo 19.6% and the median drops to 18.44. None of the percent deviations 
from tae control measures are statistically significant by the chi- 
square test, though the difference achieved by the immediately tested 
group approaches this with a "p“ equal to .15. Such an indication may 
be taken to mean that if there are any effects of training as can be 
measured by the RPA, these are short-lived indeed. . 


By far, the best performance was turned in by a group of 154 men, who, 
prior to this experment, had once passed the RPA test. When these men 
Were tested again, in this case immediately after the Evelyn training, 
only 8.3% failed and the median score is 19.03; chi-square shows the 
difference to be reliable with a "p" equal to .01. This drop in the 
number of failures may be due either to the RPA retest as such, to the 
selection of previous passers, and to the Evelyn training. A suggestion 
as to the relative involvenent of these factors may be gained by inspect- 
inz our final group of 80 men, all previous RPA test "passes", who were 
retested thougn not Evelyn trained. Here there are 13.8% failures, a 
drop fron the 18.4% normal group as substantial in magnitude as that due 
to training (14.9% for one sub-group, 16.6% for the entire group). With 
both groups identical in all respects except for the matter of Evelyn 
training, the difference between 13.8% and 8.3% failures may well be an 
indication of the influence of Evelyn training; while the difference 
between 18.4% for the normal group and 13.8% for te retest group may 
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be partly du partly due to the patent, By far the more disturbing fact, however, 
is that 13.8% of a group that once passed the RPA may fail on retest. 


As reported above, the variations in both the percent failing and the 
median score nay be generally interpreted to point to the same tendency. 

To a lesser extent, this is true as well for other indications of per- 
formance that may be found in the percent achieving a perfect score (10/10) 
and in the percent failing to see anything at all (0/20). (Fig. 3) Cumu+- 
tive percent curves in Fig. 4 also show the separation between the several 
groups, 


DISCUSSION: 


Since few of the observed differences in percent failing are reliably dif- 
ferentiated from the control measures, some caution must be exercised in ~ 
interpreting these results. Not only are the sanples relatively small, but 
the RPA, as an experimental indicator, does not mirror all the functions 
one is likely to train in night vision. One night also prefer the RPA to 
have a higher test-retest reliability. However, there are certain clear 
tendencies in the data, and another investigation.witn larger groups 

might yield not only direct verification but even statistical reliability 
for differences of the same magnitude. 


With these cautions in mind, one may assert that Evelyn training only 
slightly (1.8%) diminishes the incidence of failing the RPA, though the 
effect is larger (3.54) when the test immediately follows training, This 
finding is a disappointment should one propose use of the Evelyn to reduce 
Sharply the number of failures. A broader view may be that the substance 
of Evelyn training can improve tne lower threshold of night vision, one 
may assume that the effective carry-over to the test is sone knowledge 
about peripheral fixation and possibly a greater motivation to succeed. A 
Single session in the two-dimensional trainer scarcely affords much prac- 
tice in scanning or peripheral fixation, and additional training may be 
necessary to instill these as working habits. As measured on the RPA, the 
effect of training wears off rapidly, which fits in with these observations 
Training more likely to decrease RPA failures would have to provide ex- 
perience relevant to the test, and this information already exists. Thus, 
the New London Sub Base reports that when failures are retested, only 5% 
of the ppulation still fail after three tests. (1) 


It may well be that the performance required on the RPA is so simple and 
basic that, except for the matter of fixation, it is only relatively sus- 
ceptible to improvement by training. For this purpose, direct practice in 
fixation, repeating the test and greater emphasis on fixation by the test 
operators would result in a more uniform performance. By the same fact, 
however, one should not expect it to give the best all-around prediction 
of night look out ability. 


The question as to just what the two and thrve dimensional trainers achieve 
except for demonstration, renains unanswered and is not properly the comem 


wide 


of this study. However, such an investigation is imperative for testing 
the present effectiveness of the devices, for developing an inclusive 
“¢urriculun, for establishing practice procedures, and for instituting a 
level of competence which personnel must reach before being qualified for 
might lookout duties. 2 


ie indeed, the least failures (8.3/0) are among those who, having previously 


passed the RPA, have now been trained and retested. Selection, training 
and retest all contribute to the reduction, the intermingling of which may 


be partly judged by the percent of passers who fail on retest alone (13.8/) 


Without benefit of training. One possible flaw in these data is that it was 
impossible to check in all cases the subjects! claim of a previous passing 
performance on the RPA. Let us: note at least, that the group one would ex- 
pect to be most favored, with selection, training and retest, still exhibits 
8.5% failures. Evelyn training an unselected group could hardly reduce 
beyond that figure the percentage of failures on the first RPA. 


On a purely statistical basis the evaluation of these results must be 
tempered with caution. Five out of twenty-one different possible scores 
constitute a passing performance, while the remaining sixteen are called 
failures. The resulting frequency distribution is hardly normal and this 


we is a function, mainly, of the failure to discriminate effectively among those 
“who pass (approximately 82% of tne population}. To a lesser extent, another 
» Gontribution to distortion of the curve is inade by a segment of the population 


(approximately 8,) who probably feign night blindness and get scores of 0/20. 


ie This casts doubt over any statistical procedure based on the expectation of 


anormal distribution. 


Since none of the observed effects in this study may be said to be unduly 
large, the existence of a large group of individuals who fail in the 0/20 
Category, i.e., by failing to guess right on the test not once in 20 trials, 
constitutes a hazard for any interpretation. On a chance basis, there are 


“three ways out of four tnan an individual may be wrong on any single trial 


and 56 chances in 1000 that this could occur on 20 straight trials. Some- 
thing other than chance must be operating when 8% rather than an expected 

-3% of the group get a score of 0/20. ien examined on the RPA may feel that 
& failure on the test will save them fron going to sea and that there is an 
advantage in not being able to see anything at all. The method for achieving 
this is naive when the individual for 20 trials calls the letter "T" in a 
position directly opposite from that in which it is placed, Charting the 
dark adaptation curve or scotopic perimetry might detect the fraudulent. The 
presence, however, of such a group in an experiment might easily mean that 
the observed results are due less to the influences being studied than to the 
Capricious nature of these examinees. In the 9 months during which the RPA 
nas been employed at this station, the number of men getting this score has 


_ varied from 0 to 12.0%, with a mean of 8.5% for the total group. 


we assumed, therefore, that most of these men were probably feigning night 


blindness and desired to evaluate the.data in a way to be free of any in- 


_ fluence these men might contribute. A limit of 2% failing completely (0/20) 
_ was established, and those making up more than that percent of the population 
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itrar ly Gxetuded from the data, which was then reverked 
in Table 2. The main finding is that the tendencies originally 
ayh remain and we may, With some confidence, assert that the data 
: artifact due entirely to the chance composition of the 0/20 
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TABLE I 


Summary of Data 


P value 


xe : Median je ja 
Fail Score 10/10 0/20 4° test for 
i i 8Cores scores against control 


Control - Guonset Pt. 4477 18.4 18.28 41,4 8.5 


Ce ee re te ee 


3 5. 0 oe ° e 
et ie etoup 434 16.6 18.61 45.2 7.6 35 
Evelyn trained, 
Bt nacdiately BIG 14.6 LRKES 46.06): Bad 15 
Evelyn trained tes- oe 6,6... : tha oases FAG .65 


ted one week later 


Previous RPA pass, . 
Zvelyn trained, $OG: US. S19 OR) Brey Sad .O1 
nrA retest 


No Evelyn training, 80 1548 19 411) 56.3 7.9 020 
RrA test and retest 


* when this value reaches .05 or less, then differences as large as ~ 
this due to chance factors could have arisen only 5 tines in 100 
and the difference may then be considered reliable, 
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TABLE II 
Summary of Data 


Recalculated after limiting failures 
in the 0/20 category to 2% of the population 


N b Median fe Povalue 
2a Es 10Pio 0/20 4° test 
agains 
contr 
Control = Quonset Pt. 4188 41.2 18,55 44.2 ae 
Evelyn trained, RPA 
Tested, whole group 410 11.7 18.83 47.8 2.0 88 
Evelyn trained, tes- 
ted imnediately 260 9.6 18,92 48.9 2,90 eee 
Evelyn trained, tes- 13h 1b SST) ab ao .20 
ted one week later 
Previous RPA pass, i 
Brelyn trained, 130 5.4 19.06 53.1 A ie 
RPA retest 
No Evelyn training, 78 G,8 OK fovig: Segue a 1.00 


RPA test and retest 
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